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Abstract — Upper bounds on the maximum number of code- 
words in a binary code of a given length and minimum Hamming 
distance are considered. New bounds are derived by a combi- 
nation of linear programming and counting arguments. Some of 
these bounds improve on the best known analytic bounds. Several 
new record bounds are obtained for codes with small lengths. 

I. Introduction 

Let A{n, d) denote the maximum number of codewords in 
a binary code of length n and minimum Hamming distance 
d. A{n, d) is a basic quantity in coding theory. Lower bounds 
on A{n, d) are obtained by constructions. For survey on the 
known lower bounds the reader is referred to [9]. 

In this work we consider upper bounds on A{n, d). The 
most basic upper bound on A{n, d), d = 2e + 1, is the sphere 
packing bound, also known as the Hamming bound; 

A(n,2e+1) < . (1) 

Z^ i=0 V J 

Johnson [8] has improved the sphere packing bound. In his 
theorem, Johnson used the quantity A{n,d,w), which is the 
maximum number of codewords in a binary code of length n, 
constant weight w, and minimum distance d: 



A{n, 2e+l) < 



2" 



v-e fn\ (e;i)-C;+l')-4(»,2e+2,2e+l) 
Z^j=0 \i) ~*" A{n,2e+2,e+l) 

In [11] a new bound was obtained; 

2^ 

A{n,2e + 1) < 



(2) 



J2i=o ii) 



A(n+l,2e+2,e+2) 



(3) 

This bound is at least as good as the Johnson bound for all 
values of n and d, and for each d there are infinitely many 
values of n for which the new bound is better than the Johnson 
bound. 

When someone is given specific, relatively small values, 
of n and d, usually the best method to find upper bound on 
A{n, d) is the linear programming (LP) bound. A summary 
about this method and some new upper bounds appeared in 
[11]. However, the computation of this bound is not tractable 
for large values of n. In this work we will present new upper 
bounds on A{n, 2e + 1), e > 1. 

Let ^2 = {0,1} and let denote the set of all binary 
words of length n. For x,y £ JF^, d{x,y) denote the 
Hamming distance between x and y. Given x, y G ^-"2 such 



that d{x,y) = k, we denote by p'^j the number of words 
z G J-2 such that d{x, z) = i and d{z,y) = j. This number 
is independent of choice of x and y and equal to 

( i-j+fc ) ( "7-fc ) if « + j - fc is even. 



ifi 



k is odd. 



is the number 



J 



If X = y, then p^j = 6i,jVi, where Vi — 
of words at distance i from x G T2, and Sij = 1 if i 
and zero otherwise. We also denote w = 2". The Pi/& are 
the intersection numbers of the Hamming scheme and Vi is 
the valency of the relation Bi. For the connection between 
association schemes and coding theory the reader is referred 
to [6], [10, Chapter 21]. 

An (n, M, 2e + 1) code C is a nonempty subset of of 
cardinality M and minimum Hamming distance 2e + 1. For a 
word X G ^2, d{x, C) is the Hamming distance between x and 
C, i.e., d{x,C) ~ mincecd{x,c). A word h G ^2 is called a 
hole if d{h, C) > e and H = {h e : d{h, C) > e} is the 
set of all holes. Clearly, we have 



\H\^v-\C\V{n,e), 



(4) 



where V{n, e) = X]j=o ""i '■^^ volume of sphere of 

radius e. The distance distribution of C is defined as the 
sequence Ai = |{(ci,C2) G C x C : d{ci,C2) = i}\/\C\ for 
< i < n and Ai{c) denote the number of codewords at 
distance i from c G C. We also define the (non-normalized) 
holes distance distribution {-Dijf^o = KC^ii ^2) G 

Ti : d{hi,h2) = i}\, and Di{h) denote the number of holes 
at distance i from h £ H. Finally, we define NC{h,C, A) to 
be the number of codewords of C at distance A from a hole 
h. 

II. Holes Distance Distribution 

In the first theorem we state that for a given (n, M, 2e + 
1) code C, the distance distribution of the holes is uniquely 
determined by the distance distribution {Aj}"^Q of the code 
C. 

Theorem 1: If C is an (n, M, 2e + 1) code with distance 
distribution {Ai}2^Q, then 

D 



vv, + \C\{niC,i)~2V{n,e)v,), 



for each i, < i < n, where where 



k=0 \ j = l 1=1 m=l j=l 



(5) 

By combining (|6j and we obtain 



_ „ , , ^ , , , r „ , . , ,. Theorem 2: If C is an (n,M,2e + 1) code with distance 

Corollary 1: Let C be an (n, M, 2e + lj code with distance (jistj-jbution \A I" then 
distribution {^i}"^g. If {(7i}"^o ^ sequence of real numbers, j=0' 



then 



|C|< 



EftA-^'E'?^^' + i^iE*(^(^'*)-2n^'^)^')- ' 

i=o i=o 2=0 provided ^(C, {qi}) is not zero, where S^{C, {qi}) is given by 

i} and n{C, i) is given by (|5}. 

By using Corollary □ for any given sequence {q,}^^^ we j^^^mp/g 2 : Let gi = 1 and = for z ^ L From and 
obtain a linear combination of the Di's. By finding a lower 0~weliave 
bound on this combination we can obtain an upper bound on 

the size of C. n{C, 1) = V{n, e)vi - p^+/we+i + pI%^ plf^\A2e+i 

Example 1: Let q^ = 1 and = 0, « > 0. Clearly ^"'^ 

wo = 1 and by (|5} we have 7e(C,0) = F(n,e). After ^(C, {g^}) = max {iVC(/i, C, e + 1)} 

substituting the trivial bound Dq > to (|6} we obtain the 

sphere packing bound Q. Thus, using Theorem H we obtain 

The sequence {gi}"^o °f Corollary ^ will be called the 
holes distance indices (HDI) sequence. For convenience, in . Theorem 3: If C is an (n, M, 2e + 1) code with distance 
the rest of the paper we will write {q,} instead of {qjf^o. In distribution {Ai\\'^^, then 
the next two sections we will find some good HDI sequences \C\ < ^ 

{gi} and develop methods to find lower bounds on ^"^Q g^Di. \/( \ "e+i-p^+t^^e^a-s+i 

III. HDI Sequences with Small Indices By substituting 

In this section we consider HDI sequences, where nonzero < A{n 2e + 2 2e + 1) 

qi's correspond to small indices. The following lemma gives 
an alternative expression for Di. and 

Lemma 1: For each z, < z < n, ^J^^ {^^('^' C,e+l)}< 2e + 2, e + 1) 

(e+i e \ in (|9} we obtain the Johnson upper bound 

v.- J2 NC{h,C,k)J2ph ■ 

k=e+i j=o I Example 3: Let 



Given a sequence {qi}, by using Lemma and (|4} we qi — ^'^'e+i — — ^ , ?2 = 1, 

estimate X^"^q qiDi in the following way. Pi,e 

and qi = for i ^ {1, 2}. From (|5} and (|8j we have 

- E ^^(^C,fc)Ept- 9l7^(C,l)+<Z27^(C,2) = F(n,e)(ql^;l+g2i'2)-P2:^e'(«. 

k=e+l j=0 / 



e+1 



/ 2e+l , 2e+l , 2e+l \ a 2e+2 a \ 

-Ve+2 - (Pe+l,e +Pe+2,e-l + Pe+2.e)^2e+l - Pe+2,e^2e+2 ) 



(n 71 e+i e \ 

E - E * E ^^e^' fc) E pt- '"'^ 

.=0 ^=0 fe=e+i ,=0 J ^(CAq^})^P2te^^^^{NC{h,C,e+l) + NC{h,C,e + 2)} 

>iv- \C\V{n, e)) ^E - ^('^' j ' ^^""^^ ''"'"^ TheoremE we obtain 



Theorem 4: If C is an (n, M, 2e + 1) code with distance 
distribution {Ai}"^Q, then 



|C|< 



ma.^h^-H{NC(h,C,e+l)+NC{h,C,e+2]} 



V{n, e 
where 

/ 2e+l I 2e+l , 2e+l \ 4 , 2e+2 a 



(10) 



Theorem 5: 

A(n,2e + 1) < 



2" 



Z^i;=OU/~'~ A(n+l,2e+2,e+2) 



where max{A2e+2} is taken subject to LPe[h, 2e + 2]. 

For the next result we need the following theorem which is a 
generalization of a theorem given by Best [3]. 



By substituting 

A2e+i + A2e+2 < A{n + 1, 2e + 2, 2e + 2) 

and 

max {NC(h, C, e + 1) + A^C(/i, C,e + 2)} 

< A(n + l,2e + 2,e + 2) 

in (I10> we obtain the bound of Q- 

Next, we want to improve the trivial bound on Ai given by 
Ai < A{n, 2e + 2, i). We will find upper bounds on distance 
distribution coefficients A,;'s using linear programming. For an 
{n,M,2e + 1) code C with distance distribution {Ai}"^g let 
us denote by LP[n, 2e + 1] the following system of Delsarte's 
linear constraints: 



0< A,< A{n,2e + 2,i) 
Ao = l,A, = 



for < fc < n, 
for i = 2e + l,2e4 
for 1 < i < 2e + 1, 



2, . . . ,n, 



denote Krawtchouk 



where P,{^) = E^o(-lP Q) Glj) 
polynomial of degree k. We also denote h = n + 1 and let 
{Ai}f^o be the distance distribution of the (n + 1, M, 2e + 2) 
extended code Cg which is obtained from the (n, M, 2e + 1) 
code C with distance distribution {A^jf^Q by adding an even 
parity bit to each codeword of C. It's easy to verify that for 
each i, e + 1 < i < ln/2\, 



A. 



A2t-1 



(11) 



For the even weight code Ce of length h and distance d = 
2e + 2 we denote by LPe[h, 2e + 2] the following system of 
Delsarte's linear constraints: 

E to A,Pk{i)>Q for < fc < [h/2\ , 
<A^< A{h,d,i) for i = 2e + 2, 2e + 4, . . . , 2 [n/2j, 
Aq = 1,A, = forl<i<2e + 2. 

In some cases we will add more constraints to obtain some 
specific bounds as in [5], [7], [11], [12]. 
By Theorem2]we have that for an {n, M, 2e + 1) code C with 
distance distribution {A,}"^q the following holds: 



|C|< 



2" 



(:+2)-(';+2')(-42e+l+A2. + 2) 



e: 



+ 



A(n+l,2e+2,e+2) 



Using il l\ we obtain 



Theorem 6: Let C be a code of length n, minimum 
Hamming distance d, and distance distribution {Ai}2^Q. Let 
{Pi}?=o be a sequence of real numbers. Then there exists a 
code C of length n — 1, distance d with distance distribution 
{^^l^o' satisfying 



n-l 

^(n - i)piAi < ?^ ^ PiA'^- 

) i=0 



(12) 



It was proved in [13] by using LP that for an even 
weight code C of length n = l(mod 4), distance d = 4, and 
distance distribution {A,;}"^q, 

(n - l)(n - 2)(n - 3) 



Ai < 



24 



(13) 



We substitute pi = Si^4 in il2\ and J13> for the upper bound 
on A'^ to obtain 

Lemma 2: If C is an even weight code of length n = 
2{mod 4), distance c? = 4, and distance distribution {^i}"^g, 
then 

fi{h - 2)(n - 3) 



Aa < 



24 



We take e = 1, and n = 9{mod 12) . Since A{n + 1, 4, 3) = 
(n^ - 3)/6 for n = 9{mod 12) [10, p. 529], it follows by 
Lemma 12] and Theorem [S] that 

Theorem 7: For n = 9{mod 12) 

2" 



A{n,3) < 



The previous best known bound A{n, 3) < 2"/ {n + 3) 
for n = l{mod 4) was obtained in [4] by LP. In particular, 
we have yl(21,3) < 87348 which improves on the previous 
best known bound A{21,3) < 87376 [11]. 

IV. HDI Sequences with Large Indices 

We demonstrate another approach to estimating E"=o li^i' 
where nonzero elements of {qi} correspond to large indices. 
For each t, < t < e, we denote 



En^t ^{hen\ NC{h, C,n-t)^ 1}. 

Note, that for any hole h ^ H we have NC{h,C,n 
{0, 1}, where <t<e. 



t) e 



Lemma 3: For each t, < t < e. Lemma 4: If C is an even weight code of length h = 

ll{mod 12), distance d — and distance distribution 



K-*H|C|U„-.-E^'-EK:m • (14) ^^^}-o,then 



/ fi4 ^.r-n4 ^ (n-l)(n-2)(n + 4) 

Letg„_i-g„-land(?, = Ofori^{n-l,n}.If/ie£;„_t OA^.g + Jlri s -.^2 

for t e {0, 1, ... ,e - 1}, then 

Therefore, by Theorem |9| and Lemma |4| we have 

Dr,-i{h) + Dn{h) ^ 0. (15) 

Tr , , Theorem 10: For n = lOfmorf 12) 

If fi, e En-e, then ^ ^ 

on 

Ai-i(M + Ai(/i) >r^-e-(e + l)A(n-e,2e + 2,e+l) 3) < 



71 — 6 

= n-e^{e + l)[—-\. (16) 

e + 1 

If for a given hole there exists no codeword at distance 

fc S {n — e, 71 — (e — 1), . . . , 71 — 1, 71}, then 2 



The previous best known analytic bound 

A{n,3) < 



Dr,-i{h)+D^{h)>n+l-{e + l)A{n,2e + 2,e+l) ' ' « + 2 + 

= 77 + 1 - (e + 1)[^^J. (17) was obtained by (|3}. 

By similar arguments, if {qi} is a sequence with qi = 0, 
By combining (O-O with Corollaiy [T]we obtain except for q„_2 = Qn-i = Qn = 1, we obtain the following 



Theorem 8: 

A(77,2e + 1) < 



^Z^j=0 \i) + (e+l)[-^J , 2 



bound. 

Theorem 11: If C is an (ti, M, 2e + 1) code, then 



where ^ 2^..=o U j + (=+^)A(«+i,2e+2,e+2) 

W(7i) = max{7e(C, ri - 1) + 7e(C, 77.) 



e-l e+t 

„ + 1 _ (e + 1) L-^J ] ^ ^ A„_, K::*,, ^ _ ((„+i-e) _ (e+2^^(^^ + 1 - e, 2e + 2, e + 2)) 

^ - Cf)A(7i+l,2e + 2,e + 2) 

'(^-+ 1) (l + - L^j) E^«-^EK:L,}, and 



where 



subject to LP[77, 2e + 1] and 7^(C, 7i - 1), 7^(C, 77) are given ^(") = max{ ^ 7^(C, i) - (1 

by (|5}. i=n-2 



77+1 

2 



By TheoremlUwe obtain A(22, 3) < 172361 and A{2A, 5) < /g + 2\ ^ ^ ^ 

47538 which improve the previous best known bounds ^( 2 ) ^("- + 1, 2e + 2, e + 2)) ^ ^ ^ ^ ^n-i / ,Pn-t.i 

A{22,3) < 173015 [11] and A(24, 5) < 48008 [14]. *=o «=o j=o 

Let 71 be even integer and let e = 1. By Theorem |8] and 

we obtain / ^,/'^ + l\ /^ + 2\,., ,0 ,r,\ 

-(l + e(7i-e)+ ( 2)^(2 )(^('^+l'2e + 2,e + 2) 

Theorem 9: If 71 is an even integer, then 

2" 

271 + 3- 



e e+t 



2^ 

- o.^._^ ' -^('^ + 1 - e, 2e + 2, e + 2))) E ^"-^ E^ll,}, 

t=e-l i=0 i=0 

where 

W(7i).max{6A,_3 + 37ii,_,} subject to iP[n, 2e + l], and 7^(C, 7i-2), 7e(C, 71- 1), 7^(C, 71) 

are given by (|5). 

subject to LPe[n,4:]. 

Applying Theorem II II we obtain A(21,3) < 87333 which is 
Using LP we can prove the following lemma. better than the best previously known bound (see Section HIH. 



V. Generalization for Arbitrary Metric 
Association Schemes 

We can generalize our approach to arbitrary metric associ- 
ation scheme {X, TZ) with distance function d, which consists 
of a finite set X together with a set TZof n+1 relations defined 
on X with certain properties. For the complete definition and 
brief introduction to the association schemes, the reader is 
referred to [10, Chapter 21]. We extend the definitions from 
the first section as follows. \X\ = w is the number of points 
of a finite set X, Vi is the valency of the relation Ri, and 
p'^/s are the intersection numbers of the scheme. A code C 
is a nonempty subset of X with minimum distance 2e + 1. 
The definitions related to holes and distance distribution are 
easily generalized. The results of @, Lemma Theorems 
[2 through 0] and Corollary [2 are valid for arbitrary metric 
association schemes. 

As an example we consider the Johnson scheme. In this 
scheme X is the set of all binary vectors of length n and 
weight w. Note, that in this scheme the number of relations 
is It; + 1 and n has different meaning. The distance between 
two vectors is defined to be the half of the Hamming distance 
between them. One can verify, that v = (2,), v., = (™) ("7""), 
and J is given by 



Theorem 13: A{n, 10, w) < 



w — k 

E 

1=0 



w — k\ f k \/ k \ f n — w ~ k 



Denote by T{wi, ni,W2,n2, d) the maximum number of bi- 
nary vectors of length ni + n2, having mutual Hamming 
distance of at least d, where each vector has exactly wi ones 
in the first ni coordinates and exactly W2 ones in the last n2 
coordinates. By substituting 

max {NC{h, C, e + 1)} < T(e + 1, u;, e + 1, n - w, 4e + 2) 
in (|9}, we obtain the following bound. 



Theorem 12: 

A(n,4e+2, w) < 

where 



Uw{n) = ■max{A2e+i} ■, 

subject to Delsarte's linear constraints for Johnson scheme (see 
[10, Theorem 12, p. 666] ). 

Applying Theorem for e = 1 we obtain the following 
improvements (the values in the parentheses are the best 
bounds previously known [1], [14]): A(19,6, 7) < 519 (520), 
A{22, 6, 11) < 5033 (5064), A(26, 6, 11) < 42017 (42080). 

We would like to remark, that LP can be applied for 
upper bounds that obtained by centering a spheres around a 
codewords. We give an example of such bound. 



f™) (n.~w\ , (3)("3'") I 4 ) 7, / \ 

^1=0 \ i J ^ T {3, w, 3, n-w, 10) ^ T(4,u),4,ra-i«,10) V'V 

where 

225 



0(" 



l^w{n) ~ max{ 
100 



T(4,w,4,n- w,10) 
50n - 475 



A2e 



+2 



v^e ,iu\ ln-w\ I (e+l)("+r)-C'^e"'"^) U^i"-) 
Z^i=a\i)\ i ) ~<~ T(e+l,w,e+l,n-w,4e+2) 



^T{3,w,3,n-w, 10) ' T {4:,w, A, n - w, 10) J "^^"^^^^ 
subject to Delsarte's linear constraints for Johnson scheme. 

By Theorem [13] we have: A(23, 10,9) < 78 (81), 
A(24, 10,9) < 116 (119), A(25,10,9) < 157 (158), 
A{27, 10, 9) < 293 (299), A{28, 10, 10) < 785 (821). 
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